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Abstract 

The traditional models of distributed computing focus mainly on networks of computer-like 
devices that can exchange large messages with their neighbors and perform arbitrary local com- 
putations. Recently, there is a trend to apply distributed computing methods to networks of 
sub-microprocessor devices, e.g., biological cellular networks or networks of nano-devices. How- 
ever, the suitability of the traditional distributed computing models to these types of networks 
is questionable: do tiny bio/nano nodes "compute" and/or "communicate" essentially the same 
as a computer? In this paper, we introduce a new model that depicts a network of randomized 
finite state machines operating in an asynchronous environment. Although the computation 
and communication capabilities of each individual device in the new model are, by design, much 
weaker than those of a computer, we show that some of the most important and extensively 
studied distributed computing problems can still be solved efficiently. 



1 Introduction 



Networks are at the core of many scientific areas, be it social sciences (where networks for instance 
model human relations), logistics (e.g. traffic), or electrical engineering (e.g. circuits). Distributed 
computing is the area that studies the power and limitations of distributed algorithms and com- 
putation in networks. Due to the major role that the Internet plays today, models targeted at 
understanding the fundamental properties of networks focus mainly on "Internet-capable" devices. 
The standard model in distributed computing is the so called message passing model, where nodes 
may exchange large messages with their neighbors, and perform arbitrary local computations. 

Some networks though, are not truthfully represented by the classical message passing model. 
For example, wireless networks such as ad hoc or sensor networks, whose research has blossomed in 
the last decade, require some adaptations of the message passing model so that it meets the limited 
capabilities of the underlying wireless devices more precisely. More recently, there is a trend to 
apply distributed computing methods, and in particular, the message passing model, to networks 
of sub-microprocessor devices, for instance networks of biological cells or nano-scale mechanical 
devices. However, the suitability of the message passing model to these types of networks is far 
from being certain: do tiny bio/nano nodes "compute" and/or "communicate" essentially the same 
as a computer? Since such nodes will be fundamentally more limited than silicon-based devices, we 
believe that there is a need for a network model, where nodes are by design below the computation 
and communication capabilities of Turing machines. 

Networked Finite State Machines. In this paper, we take a radically different approach: 
Instead of imposing additional restrictions on the existing models for networks of computer-like 
devices, we introduce an entirely new model, referred to as networked finite state machines (nFSM), 
that depicts a network of randomized finite state machines progressing in asynchronous steps (refer 
to Section [2] for a formal description). Under the nFSM model, nodes communicate by transmitting 
messages belonging to some finite communication alphabet S such that a message cr E S transmitted 
by node u is delivered to its neighbors (the same a to all neighbors) in an asynchronous fashion; 
each neighbor w of n has a port corresponding to u in which the last message delivered from u is 
stored. 

The access of node v to its ports is limited: each state q in the state set Q of the FSM is 
associated with some query letter a = a{q) £ S; if node v resides in state q at some step of the 
execution, then the next state and the message transmitted by v at this step are determined by q and 
by the number tt(c) of occurrences of a in v's ports. The crux of the model is that tt(c) is calculated 
according to the one-two-man^ principle: the node can only count up to some predetermined 

^ The one-two-many theory states that some small isolated cultures (e.g., the Piraha tribe of the Amazon [20]) 
did not develop a counting system that goes beyond 2. This is reflected in their languages that include words for 
"1", "2", and "many" that stands for any number larger than 2. 
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bounding parameter b £ Z>o and any value of '^{a) larger than b cannot be distinguished from b. 

In particular, the nFSM model satisfies the following model requirements, that we believe, make 
it more applicable to the study of networks consisting of weaker devices such as those mentioned 
above. 

(Ml) The model is applicable to arbitrary network topologies. 

(M2) All nodes run the same protocol executed by a (randomized) FSM. 

(M3) The network operates within an asynchronous environment, with node activation patterns 
independent of message delivery patterns. 

(M4) All features of the FSM (specifically, the state set Q, message alphabet S, and bounding 
parameter b) are of constant size independent of any parameter of the network (including the degree 
of the node executing the FSM). 

The last requirement is perhaps the most interesting one as it implies that a node cannot perform 
any calculation that involves numbers beyond some predetermined constant. This comes in contrast 
to many distributed algorithms operating under the message passing model that strongly rely on 
the ability of a node to perform such calculations (e.g., count up to some parameter of the network 
or a function thereof). 

Results. Our investigation of the new model begins by implementing an nFSM synchronizer that 
practically allows the algorithm designer to assume a synchronous environment (Section [3]). Then, 
we show that the computational power of a network operating under the nFSM model is essentially 
equivalent to that of a randomized Turing machine with linear space bound (cf. linear bounded 
automaton). In comparison, the computational power of a network operating under the message 
passing model is trivially equivalent to that of a (general) Turing machine, therefore there exist 
distributed problems that can be solved under the message passing model in constant time but 
cannot be solved under the nFSM model at all (Section [6|). 

Nevertheless, we show that arguably the most important and extensively studied problems in 
distributed computing admit efficient — namely, with run-time polylogarithmic in the number of 
nodes — algorithms operating under the nFSM model. Specifically, we develop such algorithms 
for computing a maximal independent set (MIS) in arbitrary graphs (Section U]) and for 3-coloring 
of (undirected) trees (Section [SJ. We also develop an efficient algorithm that computes a maximal 
matching in arbitrary graphs, but this requires a small unavoidable modification of the nFSM model 
that goes beyond the scope of the current version of the paper. 

Related Work. As mentioned above, the message passing model is the gold standard when it 
comes to understanding distributed algorithms. Several variants exist for this model, differing 
mainly in the bounds imposed on the message size and the level of synchronization. Perhaps 
the most popular message passing variants are the fully synchronous local and congest models 
[26\ [3H [36], assuming that in each round, a node can send messages to its neighbors (different 
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messages to different neighbors), receive and interpret the messages sent to it from its neighbors, 
and perform an determining, in particular, the messages sent in the 

next round. The difference between the two variants is cast in the size of the communicated 
messages: the local model does not impose any restrictions on the message size, hence it can be 
used for the purpose of establishing general lower bounds, whereas the congest model is more 
information-theoretic, with a (typically logarithmic) bound on the message size. Indeed, most 
theoretical literature dealing with distributed algorithms relies on one of these two models. 

As the congest model still allows for sending different messages to different neighbors in each 
round, it was too powerful for many settings. Instead, with the proliferation of wireless networks, 
new more restrictive message passing models appeared such as the radio network model [13]. In 
radio networks, nodes still operate in synchronous rounds, where in each round a node may choose 
to transmit a message or stay silent. A transmitted message is received by all neighbors in the 
network if the neighbors do not experience interference by concurrently transmitting nodes in their 
own neighborhood. There are several variants, e.g. whether nodes have collision detection, or not. 

Since the radio network model is still too powerful for some wireless settings, more restrictive 
models were suggested. One such example is the beeping model [171 [IS]) where in each round a 
node can either beep or stay silent, and a silent node can only distinguish between the case in 
which no node in its neighborhood beeps and the case in which at least one node beeps. Efficient 
algorithms and lower bounds for the MIS problem under the beeping model were developed by Afek 
et al. [21 H]- Note that the beeping model resembles our nFSM model in the sense that the "beeping 
rule" can be viewed as counting under the one- two-many principle with bounding parameter 6 = 1. 
However, it is much stronger in other perspectives: (i) the beeping model assumes synchronous 
communication and does not seem to have a natural asynchronous variant, thus it does not satisfy 
requirement (M3) ; and (ii) the local computation is performed by a Turing machine whose memory 
is allowed to grow with the network (this is crucial for the algorithms of Afek et al. [21 [1]), thus it 
does not satisfy requirements (M2) and (M4). 

Our nFSM model is a generalization of the extensively studied cellular automaton model [301 
[TSl [38] that captures a network of FSMs, arranged in a grid topology (some other highly regular 
topologies were also considered), where the transition of each node depends on its current state 
and the states of its neighbors. Still, the nFSM model differs from the cellular automaton model in 
many aspects; in particular, the latter model is not applicable for non-regular network topologies, 
in contrast to requirement (Ml), and to the most part, it also does not support asynchronous 
environments (at least not as asynchrony is grasped in the current paper), in contrast to requirement 
(M3). 

Another model that resembles the nFSM model is that of communicating automata [12]. This 

^ It is important to point out that even though the local and congest models allow for arbitrary local computations, 
the existing literature hardly ever assumes anything that cannot be computed in time polynomial in the size of the 
information received thus far; the rare exceptions are typically clearly mentioned in the text. 
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model also assumes that each node in the network operates a FSM in an asynchronous manner, 
however the steps of the FSMs are message driven: for each state q of node v and for each message 
m that node v may receive from an adjacent node u while residing in state g, the transition 
function of v should have an entry characterized by the 3-tuple {q, u, m) that determines its next 
move. As such, different nodes would typically operate different FSMs, hence the model does 
not satisfy requirement (M2), and more importantly, the size of the FSM operated by node v 
inherently depends on the degree of v, hence it does not satisfy requirement (M4). Moreover, the 
node activation pattern is driven by the incoming messages, so it also does not satisfy requirement 
(M3). 

Applicability to Biological Cellular Networks. Regardless of the theoretical interest in im- 
plementing efficient algorithms using weaker assumptions, we believe that our new model and 
results should be appealing to anyone interested in understanding the computational aspects of 
biological cellular networks. A basic dogma in biology (see, e.g., |33] ) states that all cells commu- 
nicate and that they do so by emitting special kinds of proteins (e.g., cytokines and chemokines 
in the immune system) that can be recognized by designated receptors, thus enabling neighboring 
cells to distinguish between different concentration levels of these proteins, which, after a signaling 
cascade, leads to different gene expression. 

Translated to the language of the nFSM model, the emitted proteins correspond to the letters 
of the communication alphabet, where the actual emission corresponds to transmitting a letter, and 
the ability of a cell to distinguish between different concentration levels of these proteins corresponds 
to the manner in which the nodes in our model interpret the content of their ports. Using an FSM 
as the underlying computational model of each node seems to be the right choice especially in the 
biological setting as demonstrated by Benenson et al. [11] who showed that essentially any FSM 
can be implemented by enzymes found in cells' nuclei. One may wonder if the specific problems 
studied in the current paper have any relevance to biological cellular networks. Indeed, Afek et 
al. [2] discovered that a biological process that occurs during the development of the nervous system 
of a fly is in fact equivalent to solving the MIS problem. 

2 Model 

Throughout, we assume a network represented by a finite undirected graph G = (V, E). Under the 
networked finite state machines (nFSM) model, each node v £ V runs a protocol depicted by the 
8-tuple 

n = {Q, Qi, Qo, S, fjo, b, A, 6) , 

where 

• Q is a finite set of states; 
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• C Q is the subset of input states; 

• Qo C Q is the subset of output states; 

• S is a finite communication alphabet; 

• do G S is the initial letter; 

• b £ Z>o is a bounding parameter; let 5 = {0, 1, . . . , 6 — 1, -6} be a set of 6 + 1 distinguishable 

symbols; 

• A : Q — 5- S assigns a query letter cr G S to every state q & Q; and 

• 5 : Q X B ^ 2^^^^^^^^^ is the transition function. 

It is important to point out that protocol 11 is oblivious to the graph G. In fact, the number of 
states in Q, the size of the alphabet S, and the bounding parameter b are all assumed to be universal 
constants, independent of any parameter of the graph G. In particular, the protocol executed by 
node V e V docs not depend on the degree of v in G. We now turn to describe the semantics of 
the nFSM model. 

Communication. Node v communicates with its adjacent nodes in G by transmitting messages. 
A transmitted message consists of a single letter cr G S and it is assumed that this letter is delivered 
to all neighbors n of v. Each neighbor u has a port ipuiv) (a different port for every adjacent node 
v) in which the last message a received from v is stored. At the beginning of the execution, all 
ports store the initial letter (Tq. It will be convenient to consider the case in which v does not 
transmit any message (and hence does not affect the corresponding ports of the adjacent nodes) as 
a transmission of the special empty symbol e. 

Execution. The execution of node v progresses in discrete steps indexed by the positive integers. 
At each step t G Z>o, v resides in some state q E Q. Let \{q) = cr G S be the query letter that A 
assigns to state q and let [t((7) be the number of occurrences of a in v's ports in step t. Then, the 
pair {q', a') of state G Q in which v resides in step t + 1 and message cr' G S U {e} transmitted by 
V in step t (recall that e indicates that no message is transmitted) is chosen uniformly at random 
(and independently of all other random choices) among the pairs in 



Informally, this can be thought of as if v queries its ports for occurrences of a and "observes" the 
exact value of (j(cj) as long as it is smaller than the bounding parameter b; otherwise, v merely 
"observes" that jj((j) > b which is indicated by the symbol -b. 



<^(?,/6(tt(^)))cgx(SU{e}) 



where : Z>o ^ -B is defined as 




ifO<x<6-l; 
h otherwise . 
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Input and Output. Initially (in step 1), each node resides in some of the input states in Qj. 
The choice of the initial state of node v reflects the input passed to v at the beginning of the 
execution. This allows our model to cope with distributed problems in which different nodes get 
different input symbols. When dealing with problems in which the nodes do not get any initial 
input (such as the graph theoretic problems addressed in this paper), we shall assume that Qi 
contains a single initial state. 

We say that the (global) execution of the protocol is in an output configuration if all nodes 
reside in output states of Qo- If this is the case, then the output of node f G F is determined by 
the output state q G Qo in which v resides. 

Asynchrony. The nodes are assumed to operate in an asynchronous environment. This asyn- 
chrony has two facets: First, for the sake of convenience, we assume that the actual application 
of the transition function in each step t S Z>o of node v &V is instantaneous (namely, lasts zero 
time) and occurs at the end of the stepH the length of step t of node v, denoted L^, j , is defined as 
the time difference between the application of the transition function in step t — 1 and that of step 
t. It is assumed that L^^t is finite, but apart from that, we do not make any further assumptions 
on this length, that is, the step length L^^t is determined by the adversary independently of all 
other step lengths L^' f. In particular, we do not assume any synchronization between the steps of 
different nodes whatsoever. 

Another facet of the asynchronous environment is that a message transmitted by node v in 
step t (if such a message is transmitted) is assumed to reach the port ipu{v) of an adjacent node 
u after a finite time delay, denoted Dy^t,u- We assume that if v transmits message fii G S in step 
ti and message (T2 G S in step ^2 > ^i, then ai reaches u before (T2 does. Apart from this "FIFO" 
assumption, we do not make any other assumptions on the delays Dy^t,u- In particular, this means 
that under certain circumstances, the adversary may overwrite message ui with message fT2 in port 
"fpuiv) of u so that u will never "know" that message ai was transmittedo 

Consequently, a policy of the adversary is captured by: (1) the length Ly^t of step t of node v 
for every v £ V and t £ Z>o; and (2) the delay Dy^t,u of the delivery of the transmission of node 
V in step t to an adjacent node u for every v £ V , t £ Z>o, and u G A(t;)If| Assuming that the 
adversary is oblivious to the random coin tosses of the nodes, an adversarial policy is depicted by 
infinite sequences of Ly^t and Dy^t,u parameters. 

^ This assumption can be lifted at the cost of a more comphcated definition of the adversarial policy described 
soon. 

^ Often, much stronger assumptions are made in the literature. For example, a common assumption for asyn- 
chronous environments is that the port of node u corresponding to the adjacent node v is implemented by a buffer 
so that messages cannot be "lost" . We do not make any such assumption for our nFSM model. 

^ We use the standard notation N{v) for the neighborhood of node v in G, namely, the subset of nodes adjacent 
to V. 
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For further information on asynchronous environments, we point the reader to one of the stan- 
dard textbooks [3HI28]. 

Correctness and Run-Time Measures. A protocol IT for problem P is said to be correct under 
the nFSM model if for every instance of P and for every adversarial policy, 11 reaches an output 
configuration within finite time with probability 1, and for every output configuration reached by 11 
with positive probability, the output of the nodes is a valid solution to P. Given a correct protocol 
n, the complexity measure that interests us in the current paper is the run-time of 11 defined as 
follows. 

Consider some instance I of problem P. Given an adversarial policy A and a sequence (actually 
an n-tuple of sequences) TZ of random coin tosses that lead to an output configuration within finite 
time, the run-time Tn(X, ^,7^) of 11 on X with respect to A and TZ is defined as the (possibly 
faotionaJ) number of that pass from the beginning of the execution until the first time 

the protocol reaches an output configuration, where a time unit is defined to be the maximum 
among all step length parameters Ly^t and delivery delay parameters Dy^t,u appearing in A before 
the output configuration is reached. Let TuiX^A) denote the random variable that depicts the 
run-time of 11 on X with respect to A. Following the standard procedure in this regard, we say that 
the run-time of a correct protocol 11 for problem P is f{n) if for every n-node instance X oi P and 
for every adversarial policy A, it holds that TuiX^A) is at most f{n) in expectation and with high 
probability. The protocol is said to be efficient if its run-time is poly logarithmic in the size of the 
network (cf. [26j). 

3 Convenient Transformations 

In this section, we show that the nFSM protocol designer may, in fact, assume a slightly more 
"user-friendly" environment than the one described in Section [2J This is based on the design of 
black-box compilers transforming a protocol that makes strong assumptions on the environment 
into one that does not make any such assumptions. Specifically, the assumptions that can be lifted 
that way are synchrony (Section l3.ip . and multiple- letter queries (Section l3.2p . 

3.1 Implementing a Synchronizer 

As described in Section [2l the nFSM model assumes an asynchronous environment. Nevertheless, 
it will be convenient to extend the nFSM model to synchronous environments. One natural such 
extension augments the model described in Section [2] with the following two synchronization prop- 

® Note that time units are defined solely for the purpose of the analysis. Under an asynchronous environment, the 
nodes have no notion of time and in particular, they cannot measure a single time unit. 
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erties for every two adjacent nodes u,v gV and for every t £ Z>o: 

(51) when node u is in step t, node v is in step t — 1, t, or t + 1; and 

(52) at the end of step t + 1 of n, port ipu{v) stores the message transmitted by v in step t of f 's 
execution (or the last message transmitted by v prior to step t if w does not transmit any message 
in step t). 

An environment in which properties (SI) and (S2) are guaranteed to hold is called a locally syn- 
chronous environment. Local-only communication can never achieve global synchrony, however, 
research in the message passing model has shown that local synchrony is often sufficient to provide 
efficient algorithms (HO [5]. To distinguish a protocol assumed to operate in a locally synchronous 
environment from those making no such assumptions, we shall often refer to the execution steps of 
the former as rounds (cf. fully synchronized protocols). Our goal in this section is to establish the 
following theorem. 

Theorem 3.1. Every nFSM protocol H = {Q, Qi,Qoj 5], ctq, b, A, 5) designed to operate in a locally 
synchronous environment can be simulated in an asynchronous environment by a protocol H at the 
cost of a constant multiplicative run-time overhead. 

The procedure in charge of the simulation promised in Theorem 13. II is referred to as a synchro- 
nizer [3] . The remainder of Section 13.11 is dedicated to the design (and analysis) of a synchronizer 
for the nFSM model. 

Overview. Round t G Z>o of node v £ V under 11 is simulated by 0(1) contiguous steps under 
11; the collection of these steps is referred to as w's simulation phase of round t. Protocol 11 is 
designed so that v maintains the value of t mod 3, referred to as the trit (trinary digit) of round t, 
which is also encoded in the message transmitted by v at the end of round i0 The main principle 
behind our synchronizer is that node v will not move to the simulation phase of round t -\- 1 while 
its ports still contain messages sent in a round whose trit is t — 1 mod 3. 

Under 11, the decisions made by node v at round t should be based on the messages transmitted 
by all neighbors u of v at round t — 1. However, during v^s simulation phase of round t, port 'tpyiu) 
may contain messages transmitted at round t — 1 or at round t under 11. The latter case is prob- 
lematic since the message transmitted by u in the simulation phase of round t — 1 is overwritten by 
that transmitted in the simulation phase of round t. To avoid this obstacle, a message transmitted 
by node u under 11 at the end of the simulation phase of round t also encodes the message that u 
transmitted under 11 at round t — 1. 

So, if V resides in a state whose query letter is cr G S in round t under 11, then under IT, v should 
query for all S-letters encoding a transmission of a at round t — 1. Since there are several such 
letters, a carefully designed feature should be used so that 11 accounts for their combined number. 

Note that maintaining the value of t mod 2 is insufficient for the sake of reaching synchronization. 
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Protocol n. Let 

Consider node v G V and round t G Z>o. As the name implies, node v's simulation phase of round 
t under IT, denoted <pv{t), corresponds to round t of IT. Protocol 11 is designed so that at every 
step in (j)v{t) other than the last one, v does not transmit any message (indicated by transmitting 
e), and at the last step of the simulation phase, v always transmits some message a G S, denoted 

The alphabet T, is defined to be 

S' = (SU{e}) X (SU{e}) X {0,1,2} . 

The semantics of the message My{t) = {a,a',j) sent by node v at the last step of the simulation 
phase (pv{t) is that: v transmits a G S U {e} at round t — 1 under 11; v transmits cr' G S U {e} at 
round t under 11; and j = t mod 3. Following that logic, we set ao = (e, ctq, 0). 

The state set Q of 11 is defined to be 

Q = i\J{PgUSg)] X {0,1,2} , 

\qeQ J 

where Pq x {j} and Sg x {j}, q £ Q, j £ {0,1,2}, are referred to as the pausing and simulating 
features, respectively, whose role will be clarified soon. Suppose that v resides in state q £ Q in 
step t under 11 and that j = t mod 3. Then, throughout (pvit), node v resides in some state in 
{Pq U Sq) X {j}. In particular, in the first steps of the simulation phase, v resides in states of the 
pausing feature Pg x {j}, and then at some stage it switches to the simulating feature Sq x {j} and 
remains in its states until the end of the simulation phase. 



The Pausing Feature. For the simulation phase of round t, we denote the letters in (Suje}) x 
(E U {e}) X {j - 2} as dirty and the letters in (S U {e}) x (S U {e}) x {j - as cleanU The 
purpose of the pausing feature Pq x {j} is to pause the execution of v until its ports do not contain 
any dirty letter. This is carried out by including in Pq x {j} a state p^-^a' for every cr, cr' G S U {e}; 
the query letter of Po-.o-' is (the dirty letter) X{pa,a') = {o',cr',j — 2) and the transition function 6 is 
designed so that v moves to the next (according to some fixed order) state in the feature Pq x {j} 
if and only if there are no ports storing the query letter. 

We argue that the pausing feature guarantees synchronization property (SI). For the sake of 
the analysis, it is convenient to assume the existence of a fully synchronous simulation phase of 
a virtual round 0; upon completion of this simulation phase (at the beginning of the execution), 
every node v £ V transmits the message M^(0) = a^. We are now ready to establish the following 
lemma. 

* Throughout this section, arithmetic involving the parameter j is done modulo 3. 
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Lemma 3.2. For every t G Z>o, v £ V , and u € N{v), when v completes the pausing feature of 
(j)v{t), port ipv{u) stores either Mu{t — 1) or Mu{t). 

Proof. By induction on t. The base case of round t = holds by our assumption that (/>i,(0) 
and i?^u(0) are fully synchronous. Assume by induction that the assertion holds for round t — 1. 
Applying the inductive hypothesis to both u and we conclude that (1) when v completes the 
pausing feature of — 1), port ipviu) stores either Mu{t — 2) or Mu{t — 1); and (2) when u 
completes the pausing feature of (puit — 1), port ipu{v) stores either My(t — 2) or My{t — 1). 

Let Tu and denote the times at which u and v complete the pausing feature of </>n(i) and 
4>vit), respectively. Since v cannot complete the pausing feature of ipvit) while Mu{t — 2) is still 
stored in ipv{u), it follows that at time Ty, port tpv{u) stores the message Mu{t') for some t' > t — 1. 
Our goal in the remainder of this proof is to show that t' < t. If < Tu, then t' must be exactly 
t — 1, which concludes the inductive step for that case. 

So, assume that t„ > and suppose by contradiction that t' > t + 1. Using the same line of 
arguments as in the previous paragraph, we conclude that at time r^, port V'm(^) stores the message 
My{t — 1). Node u cannot complete the pausing feature of (/>u(t + 1) while My(t — 1) is still stored 
in ')pu{v), hence v must have transmitted My{t) before u completed the pausing feature of (l)u{t + l). 
But this means that v completed the pausing feature of (pvit) before u could have transmitted 
Mu{t + 1), in contradiction to the assumption that ipv{u) stores Mu{t') for some t' >t + 1 aX, time 
Ty. The assertion follows. □ 

Consider two adjacent nodes u,v . If node u is at round t — 1 when an adjacent node v is 
at round t + 1, then v completed the pausing feature of (l)v{t) before u transmitted Mu{t — 1), in 
contradiction to Lemma |3.2[ Therefore, our synchronizer satisfies synchronization property (SI). 
Furthermore, a similar argument shows that between the time v completed the pausing feature of 
(t)v{t) and the time v completed the simulation phase (l)v{t) itself, the content of tpviu) may change 
from Mu{t — 1) to Mu{t) (if it was not already Mu{t)), but it will not store Mu{t') for any t' > t. 
This fact is crucial for the implementation of the simulation feature. 

The Simulation Feature. Upon completion of the pausing feature Pq x {j}, v moves on to the 
simulation feature Sq x {j}. The purpose of this feature is to perform the actual simulation of 
round t in v, namely, to determine the state (of Q) dominating the simulation phase of the next 
round and the message transmitted when moving from the simulation phase of the current round 
to that of the next round. 

To see how this works out, suppose that X{q) = a € S. We would have wanted node v to 
count (up to the bounding parameter b) the number of occurrences of E-letters in its ports that 
correspond to the transmission of a at round t—1 under 11, that is, the number of occurrences of 
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letters in Tt-i U Tt, where 



Tt-i = {(a',(7,j-l) I (7'eEU{e}} and Ft = { (a, a', j) | a' G E U {e}} . 

More formally, the application of the transition function S at the end of the simulation phase (f)^ (t) 
should be based on fbiYl-yeTt-iuVt tK^))' where jj(7) stands for the number of occurrences of the 
letter 7 in the ports of v at the end of ^^(t). 

Identifying the integer b with the symbol -b, we observe that the function : Z>o B satisfies 

fb{x + y) = ram{fb{x) + fb{y),b} 

for every x,y E Z>o. A natural attempt to compute /6(X]7Grt_iurt tt(7)) would include in the 
feature Sq x {j} a state Sj^i for every letter 7 G r^-i ur^ and integer i e {0, . . . ,b}; the query letter 
of Sj^i would be A(s-y) = 7 and the transition function 5 would be designed so that v moves from s-y^j 
to sy,i', where 7' follows 7 in some fixed order of the letters in Ft-iUTt and i' = min{z +/(,((! (7)), b}. 

However, care must be taken with this approach since (1(7) may decrease (respectively, increase) 
during ^y{t) for 7 G Tt-i (resp., for 7 G Tt) due to new incoming messages. To avoid this obstacle, 
we design the feature Sg x {j} so that first, it computes tpi •(— fbiJ^'yeVt-i IKt)); next, it computes 
^2 ^ fbiY/jGrt tt(7)); and finally, it computes "again" (^3 ^ hiT.-yert-i ^(t))- If 'Pi = h, then the 
current simulation phase is over and 5 is applied, simulating 5{q, fb{4>i + <p2))', otherwise, the feature 
Sq X {j} is invoked from scratch. Since the value of /^(^^grt-i Kl)) cannot increase during the 
simulation phase, and since (pi < b, the feature Sq x {j} is invoked at most b times throughout the 
execution of the simulation phase. By induction on t, we conclude that our synchronizer satisfies 
synchronization property (S2), which concludes the correctness proof of the simulation. 

Accounting. It remains to show that all ingredients of protocol II are of constant size and that the 
run-time of protocol 11 incurs at most a constant multiplicative overhead on top of that of protocol 
n. The former claim is established by following our synchronizer construction, observing that 
|S| = O (|Ep) and \Q\ = O [\Q\ ■ + |E| • b)) (recall that the bounding parameter b remains 
unchanged). For the latter claim, we need the following definition: given some node subset U eV 
and round t G Z>o, let T{U,t) denote the first time at which u completed simulation phase (f)u{t) 
for all nodes u G U. The following proposition can now be established. 

Proposition 3.3. For every node v G V and round t G Z>0; the time difference t{{v}, t + 1) — 
t{N{v) U {v},t) is (up)bounded by a constant. 

Proof. Since each transmitted message has a delay of at most 1 unit of time, it follows that by 
time t{N{v) U {v},t) + 1, message My_{t) must reach ipv{u) for all u G N{v). The pausing and 
simulation features of ^t,(t + l) are then completed within 0(|Ep) and 0(|E| -6) steps, respectively. 
The assertion follows as each step lasts for at most 1 unit of time. □ 
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Employing Proposition 13.31 we conclude by induction on t that t{V, t) = 0{t) for every t G Z>o, 
hence if the execution of protocol 11 requires T rounds, then the execution of protocol 11 is completed 
within 0{T) time units. Theorem 13.11 follows. 

3.2 Multiple-Letter Queries 

Recall that according to the model presented in Section [21 each state g G Q is associated with a 
query letter X{q) and the application of the transition function when node v resides in state q is 
determined by /6(tJ(c)), where (1(0") is the number of occurrences of the letter a in the ports of 
V. Prom the perspective of the protocol designer, it is often more convenient to assume that the 
node queries on all letters simultaneously, namely, that the application of the transition function 
is determined by the vector (/^(^(o")))^^^- 

Now that we may assume a synchronous environment, this stronger multiple- letter queries 
assumption can easily be supported. Indeed, at the cost of increasing the number of states and the 
run-time by constant factors, one can subdivide each round into |S| subrounds, dedicating each 
subround to a different letter in S, so that at the end of the round, the state of v reflects /fe(tJ(<7)) 
for every o" E S. 

Theorem 3.4. Every nFSM protocol with multiple-letter queries can be simulated by an nFSM 
protocol with single-letter queries at the cost of a constant multiplicative run-time overhead. 



4 Maximal Independent Set 

Given a graph G = {V,E), the maximal independent set (MIS) problem asks for a node subset 
[7 C y which is independent in the sense that (U x U) D E = 0, and maximal in the sense that 
[/' C y is not independent for every U' D U. Distributed MIS algorithms with logarithmic run-time 
operating in the message passing model were presented by Luby |27] and independently, by Alon 
et al. [3]p Luby's algorithm has since become a specimen of distributed algorithms; in the last 25 
years, researchers have tried to improve it, if only e.g., with an improved bit complexity [29], on 
special graph classes [SIES], or in a weaker communication model [l]. An Q(vTogn)-lower bound 
on the run-time of any distributed MIS algorithm operating in the message passing model was 
established by Kuhn et al. [23]. Our goal in this section is to design an nPSM protocol for the MIS 
problem with run-time O(log^n). 

Outline of the Key Technical Ideas. Our protocol is inspired by the existing message passing 
MIS algorithms. Common to all these algorithms is that they are based on the concept of grouping 

® The focus of [27] and [3] was actually on the PRAM model, but their algorithms can be adapted to the message 
passing model. 
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consecutive rounds into phases, where in each phase, nodes compete against their neighbors over 
the right to join the MIS. Existing implementations of such competitions require at least one of 
the following three capabilities: (1) performing calculations that involve super-constant numbers; 
(2) communicating with each neighbor independently; or (3) sending messages of super-constant 
size, specifically, of size clogn for some constant c > 0. The first two capabilities are clearly out of 
the question for an nFSM protocol. The third one is also not supported by the nFSM model, but 
perhaps one can divide a message with a logarithmic number of bits over logarithmic many rounds, 
sending 1 (or 0(1)) bits per round (cf. Algorithm B in [29j)? 

This naive attempt results in phases of length clogn. However, no FSM can count the rounds 
in a c log n long phase — a task essential for deciding if the current phase is over and the next one 
should begin. Furthermore, to guarantee fair competition, the phases must be aligned across the 
network, thus ruling out the possibility to start node u's phase i before phase i — 1 of some node 
u G N{v) is finished. In fact, an efficient algorithm that requires uj{1) long aligned phases cannot 
be implemented under the nFSM model. So, how can we decide if node v joins the MIS using 
constant size messages without the ability to maintain long aligned phases? 

This issue is resolved by relaxing the requirements that the phases are aligned and of a pre- 
determined length, introducing a feature referred to as a tournament. Our tournaments are only 
"softly" aligned and their lengths are determined probabilistically, in a manner that can be main- 
tained under the nFSM model. Nevertheless, they enable a fair competition between neighboring 
nodes, as desired. 

The Protocol. Employing Theorems 13.11 and 13.41 we assume a locally synchronous en- 
vironment and use multiple-letter queries. The state set of the protocol is Q = 
{WIN, LOSE, DOWNi,DOWN2,UPo,UPi,UP2}, with Qj = {DOWMi} (the initial state of aU nodes) and 
Qo = {win, lose}, where WIN (respectively, LOSE) indicates membership (resp., non-membership) 
in the MIS output by the protocol. The states in Qa = Q — Qo are called the active states and 
a node in an active state is referred to as an active node. We take the communication alphabet 
S to be identical to the state set Q, where the letter transmissions are designed so that node v 
transmits letter q whenever it moves to state q from some state q' ^ q; no letter is transmitted in 
a round at which v remains in the same state. Letter DOWNi is the initial letter stored in all ports 
at the beginning of the execution. The bounding parameter is set to 6 = 1. 

A schematic description of the transition function is provided in Figure [U its logic is as follows. 
Each state q G Qa has a subset D{q) C Qa of delaying states: node v remains in the current 
state q as long as (at least) one of its neighbors is in some state in D{q). This is implemented by 
querying on the letters (corresponding to the states) in D(q), staying in state q as long as at least 
one of these letters is found in the ports. Specifically, state DOWNi is delayed by state DOWN2, which 
is delayed by all three UP states. State UPj, j = 0, 1, 2, is delayed by state UP,_i mod 3) where state 
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Figure 1: The transition function of the MIS protocol with state names abbreviated by their first 
(capital) letters. The node stays in state q (a.k.a. delayed) as long as [t(g') > for any state q' 
such that aq' ^ q transition is defined (for clarity, this is omitted from the figure). Assuming that 
the node is not delayed, each transition specified in the figure is associated with a condition on the 
number of appearances of the query letters in the ports (depicted by the corresponding lower-case 
letter) so that the transition is followed only if the condition is satisfied (an empty condition is 
satisfied by all port configurations); if some port configuration satisfies several transition conditions, 
then one of them is chosen uniformly at random. 

UPo is also delayed by state DOWNi. 

States WIN and LOSE arc sinks in the sense that a node that moves to one of these states will 
stay there indefinitely. Assuming that node v does not find any delaying letter in its ports, the 
logic of the UP and DOWN states is as follows. Prom state DOWNi, v moves to state UPq. From state 
DOWN2, V moves to state DOWNi if tl(WIN) = 0, that is, if it does not find any WIN letter in its ports; 
otherwise, it moves to state LOSE. When in state UPj, v tosses a fair coin and proceeds as follows: if 
the coin turns head, then v moves to state UPj+i mods! if the coin turns tail, then v moves to state 
WIN if ti(UPj) = tt(UPj+i mods) = 0; and to state DOWN2 otherwise. This completes the description of 
our nFSM protocol for the MIS problem. 

Turns and Tournaments. Our protocol is designed so that an active node v traverses the DOWN 
and UP states in a (double-)circular fashion: an inner loop of the UP states (moving from state \JPj 
to state UPj_|_i jiiod 3) nested within an outer loop consisting of the DOWN states and the inner loop. 
Of course, v may spend more than one round at each state q G (delayed by adjacent nodes in 
states D(q)); we refer to a maximal contiguous sequence of rounds that v spends in the same state 
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q G Qa as a q-turn, or simply as a turn if the actual state q is irrelevant. A maximal contiguous 
sequence of turns that starts at a DOWNi-turn and does not include any other DOWNi-turn (i.e., a 
single iteration of the outer loop) is referred to as a tournament. We index the tournaments and 
the turns within a tournament by the positive integers. Note that by definition, every tournament 
i of V starts with a DOWNi-turn, followed by a non-empty sequence of UP-turns. If tournament i + 1 
of V exists, then tournament i ends with a D0WN2-turn; otherwise, it ends with an UP-turn. The 
following observation is established by induction on the rounds. 

Observation 4.1. Consider some node v £ V in turn j G Z>o of tournament i G Z>o md some 
active node u G N{y). 

• If this is aDOWNi-iurn ofv (j = 1), thenu is in either (A) the last (t)0WM2-Jtern of tournament 
i — 1; (B) turn 1 of tournament i; or (C) turn 2 of tournament i. 

• // this is an VP-turn of v (j > 2 ), then u is in either (A) turn j — 1 of tournament i; (B) turn 
j of tournament i; (C) turn j + 1 of tournament i; or (D) the last (t)DWN2-Jtern j' < j + 1 of 
tournament i. 

• // this is a D0WN2-^tirn of v (the last turn of this tournament), then u is in either (A) an 
HP-turn j' > j — 1 of tournament i; (B) the last (t)0WN2-Jiurn of tournament i; or (C) turn 1 
of tournament i + 1. 

Given some ?7 C 1/ and i,j G Z>o, let Tjj{i,j) denote the first time at which every node v £ U 
satisfies either 

(1) V is inactive; 

(2) V is in tournament i' > i; 

(3) V is in the last (D0WN2-)turn of tournament i; or 

(4) V is in turn j' > j of tournament i. 

Employing Observation 14. H the delaying states feature guarantees that 

Ty{i,j + 1) < Tjv(^)u{t,}(«,j) + 1 (1) 

for every v £ V and i,j £ Z>o. Since Tu{i,j) < Tv(i.,j) for every U C 1/, we can apply inequal- 
ity (dl) to each node v £V , concluding that 

Tv{i,i + i) < ry(i,i) + i, 

which immediately implies that 

Tv{i,k + l) < Tv{i,l) + k. (2) 

Geometric Random Variables. Consider some v £ V and i £ Z>o. Assuming that tournament 
i of V exists, let Xy{i) denote its length in terms of number of turns. For the sake of simplifying the 
analysis, if tournament i is the last tournament ofv, then we actually take Xjj{i) to be its length plus 
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1 (this is done in order to compensate for the missing D0WN2-turn in the end of the tournament.) The 
logic of the UP states imphes that X^{i) is a random variable that obeys distribution Geom(l/2) + 2, 
namely, a fixed term of 2 plus the geometric distribution with parameter 1/2, independently of 
X^i{i') for any v' ^ v and/or i' ^ i. Since the maximum of n independent Geom(l/2)-random 
variables is O(logn) with high probability, inequality [2] yields the following observation. 
Observation 4.2. For every i S Z>o, Tv{i, 1) is finite with probability 1 and 

Tv{i + 1,1) < Tv{i,l) + 0{logn) 

with high probability. 

Our protocol is designed so that node v moves to an output state (WIN or LOSE) in the end 
of each tournament with positive probability. Moreover, the logic of state DOWN2 guarantees that 
if node v moves to state WIN in the end of tournament i, then all its active neighbors move to 
state LOSE in the end of their respective tournaments i. By Observation 14. 2t we conclude that our 
protocol reaches an output configuration with probability 1 and that every output configuration 
reflects an MIS. It remains to bound the run-time of our protocol. 

The Virtual Graph G*. Let be the set of nodes for which tournament i exists and let 
= {V^,E^) be the subgraph induced on G by V^, where = E D {V^ x y*)0 Given some 
node V G V^, let N^{v) = {u £ \ {u,v) G E} be the neighborhood of node v in and let 
(^{v) = \N^'{v)\ be its degree. Note that the graph is virtual and defined solely for the sake 
of the analysis; in particular, we do not assume that there exists some time at which the graph 
induced by any meaningful subset of the nodes (say, the nodes in tournament i) agrees with G*. 
The key observation in this context is that conditioned on G*, the random variables Xy{i), v £ V^, 
are (still) independent and obey distribution Geom(l/2) -|- 2. Moreover, the graph G^~^^ is fully 
determined by the random variables Xv(i), v £ V^. Our analysis relies on the following lemma. 
Lemma 4.3. There exist two constants < p,c < 1 such that \E^~^^\ < c\E'^\ with probability at 
least p. 

We will soon turn to proving Lemma [4.31 but first, let us explain why it suffices for the comple- 
tion of our analysis. Define the random variable Y = min{i G Z>o : \E^\ = 0}. Lemma 14.31 implies 
that Y is stochastically dominated by a random variable that obeys distribution NB(0(logn), 1 — 
p) + O(logn), namely, a fixed term of O(logn) plus the negative binomial distribution with param- 
eters O(logn) and 1 — p, hence Y = O(logn) in expectation and with high probability. Since the 
nodes in V — are all in an output state (and will remain in that state), and since the logic of 
the UP states implies that a degree-0 node in G* will move to state WIN in the end of tournament 
i (with probability 1) and thus, will not be included in V^^^, we can employ Observation 14.21 to 
conclude that the run-time of our protocol is 0(log^ n). 

The notation G* used in this section should not be confused with the i"^ power of G. 
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The remainder of this section is dedicated to estabUshing Lemma 14.31 The proof technique we 
use for that purpose resembles (a hybrid of) the techniques used in |3] and }29j for the analysis of 
their MIS algorithms. We say that node v £ is good in if 

\{u G N\v) I d\u) < d'{v)}\ > d\v)/3, 

i.e., if at least third of v^s neighbors in G' have degrees smaller or equal to that of v. The following 
lemma is established in [3]. 

Lemma 4.4 ([3]). More than half of the edges in are incident on good nodes in G*. 

Disjoint Winning Events. Consider some good node v in G* with d = d^{v) > and let iV*(t>) = 
{u G N^{v) I d^{u) < d}. Recall that the definition of a good node implies that |A'^*(f)| > d/3. We 
say that node u G N^{v) wins v in tournament i if 

Xuii) > max | w G N\u) U N\v) - {n}| 

and denote this event by A'^{u,v). The main observation now is that if u wins v in tournament i, 
then in the end of their respective tournaments i, u moves to state WIN and v moves to state LOSE. 
Moreover, the events A'^{u,v) and A'^{w,v) are disjoint for every u,w £ N'^{v), u^w. 

Let ui,. . . ,Uk be the nodes in N^{u) U N^{v), where < k < 2d hy the definition of a good 
node. Let B^{u,v) denote the event that the maximum of {Xu^{i) \ 1 < £ < A;} is attained at a 
single 1 < £ < k. Since . . . ,Xuf.{i) are independent random variables that obey distribution 

Geom(l/2) +2, it follows that ¥{B\u,v)) > 2/3. Therefore, 

P {A\u, v)) = P {A'{u, v) I B'{u, v)) ■ P {B'{u, t;)) > ^ • ^ , 

which implies that 

¥{v ^ I V is good in G*) > P I \/ A'{u, v) j 

4^ ^ ^ ' - 3 2d 3 9 

Combined with Lemma 14.41 we conclude that E[|S*+^|] < § \E^\. Lemma 1131 follows by Markov's 
bound. 

Theorem 4.5. There exists an nFSM protocol that computes an MIS in any n-node graph with 
run-time O(log^n). 

5 Coloring a Tree with 3 Colors 

Given a graph G = {V, E), the coloring problem asks for an assignment of colors to the nodes such 
that no two neighboring nodes have the same color. A coloring using at most k colors is called a 
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k-coloring. The smallest number of colors needed to color graph G is called its chromatic number, 
denoted by x{G)- In general, x{G) is difficult to compute even in a centralized model [lOJ- As such, 
the distributed computing community is generally satisfied already with a (A + 1)-, 0(A)-, or even 
A*^(^)-coloring, where A = A(G) is the largest degree in the graph G, with possibly A(G) ^> x{G) 
[T5l [321 [T9l [26t [371 El 1221 [HI [3 |35]- However, even for relatively simple graph classes, A may grow 
with n. As the output of each node under the nFSM model is taken from a constant size set, we 
must and will tackle a graph class that features a small chromatic number: trees. 

Any tree T has a chromatic number x{T) = 2. Unfortunately, it is easy to show that in general, 
the task of 2-coloring trees requires run-time proportional to the diameter of the tree even under the 
message passing model, and hence cannot be achieved by an efficient distributed algorithm. The 
situation improves dramatically once 3 colors are allowed; indeed. Cole and Vishkin [15] presented 
a distributed algorithm that 3-colors directed paths, and in fact, any directed tree (directed in the 
sense that each node knows the port leading to its unique parent), in time 0(log*n). Linial |26j 
showed that this is asymptotically optimal. 

Since it is not clear how to represent directed trees in the nFSM model, we focus on undirected 
trees, designing an nFSM protocol that 3-colors any n-node (undirected) tree in run-time O(logn). 
A lower bound result of Kothapalli et al. [21] shows that this cannot be improved (asymptotically) 
even by a message passing algorithm as long as the size of each message is 0(1). 

Employing Theorems 13.11 and 13.41 we assume a locally synchronous environment and use 
multiple-letter queries. The description of the protocol will not dwell into the level of defining 
the states and transition function (as we did in Section [H for the MIS protocol), but the reader will 
be easily convinced that this protocol can indeed be implemented under the nFSM model. 

The Modes. At all times, each node v is in one of the following three modes. 

(1) Mode COLORED: the color of v is determined [v is in an output state) and it no longer takes an 
active part in the protocol. 

(2) Mode ACTIVE: the color of v has not been determined yet and v takes an active part in the 
protocol. 

(3) Mode WAITING: the color of v has not been determined yet and v is waiting for one of its 
neighbors to be colored before it resumes taking an active part in the protocol (going back to mode 
ACTIVE). 

Initially, all nodes are in mode ACTIVE. When an ACTIVE node moves to mode COLORED, as- 
signed with color c G {1,2,3}, it transmits a 'my color is c' message and it does not transmit any 
more messages; when an ACTIVE node moves to mode WAITING, it transmits an 'I am WAITING' 
message and it does not transmit any more messages until it returns to mode ACTIVE, in which 
case it transmits an T am ACTIVE' message. Therefore, the message stored in the port of node v 
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corresponding to neighbor n of v always indicates (perhaps among other things) the current mode 
of u. 

The Phases. The execution of the protocol is divided into phases indexed by the positive integers, 
where each phase consists of 4 rounds. Consider some phase i G Z>o. Let be the set of ACTIVE 
nodes at the beginning of phase i and let be the forest induced on T by (F* may contain one 
or more trees), referred to as the ACTIVE forest. Given some node v £ V^, let N^(v) = {u E \ 
{u,v) G E} be the neighborhood of v in and let d'^{v) = \N^{v)\ be its degree. 

The structure of the phases is as follows. Consider some node v G V. In round 1 of the 
phase, V transmits an 'I am ACTIVE' message. Setting the bounding parameter of the protocol to 
6 = 3, we conclude that in round 2, v can distinguish between the cases (F{v) = 0, d^{v) = 1, 
d*(f) = 2, and d^{v) > 3 simply by querying its ports for 'I am ACTIVE' messages; in other words, 
V "knows" /3((i*(f)), i.e., its degree calculated with respect to the one-two-many principle with 
bounding parameter 6=3. Employing this "knowledge", v transmits f3{d^{v)) in round 2 of phase 
i, so in round 3, the port of v corresponding to u stores a message indicating fs{d'-{u)) for every 
node u G N'^{v). 

Rounds 3 and 4 of phase i are dedicated to Procedure RandColor that we will describe soon. 
Whether or not v runs Procedure RandColor depends on the degree of v and on the degrees of its 
ACTIVE neighbors. Specifically, v runs Procedure RandColor if: (1) d^{v) = 0; (2) d'^{v) = 1 with 
N'{v) = {u} and d'{u) = 1; or (3) d'{v) = 2 with N'{v) = {^1,^2} and d' (m) , d' {U2) < 2. In 
contrast, if d^{v) = 1 with N'^(v) = {u} and d^(u) > 2, then v moves to mode WAITING without 
running Procedure RandColor, in which case we say (just for the sake of the analysis) that v waits 
on u. Otherwise {d^{v) > 3 or d'^{v) = 2 with some neighbor u G N''{v) such that d^{u) > 3), v 
remains in mode ACTIVE without running Procedure RandColor. 

As stated beforehand, the COLORED nodes do not take an active part in the protocol. A WAITING 
node V moves to mode ACTIVE in the end of phase i if some neighbor u of v, u e V^, moves to mode 
COLORED during phase i {v spots this event by querying on 'my color is c' messages). 

Procedure RandColor. Responsible for the actual color assignments. Procedure RandColor takes 
2 rounds (rounds 3 and 4 of some phase). Only an ACTIVE node may run the procedure, and when 
the procedure is over, the node either stays in mode ACTIVE or moves to mode COLORED. Consider 
some node v running the procedure and let C{v) C {1, 2, 3} be the subset of colors which are not 
yet assigned to the neighbors of v in T. (Our analysis shows that if v is ACTIVE, then C{v) ^ 0.) 
As every COLORED node transmits a message indicating its color, v can determine C{v) by querying 
its ports. 

In the first round of Procedure RandColor, v picks some color c G C{v) uniformly at random 
and transmits a 'proposing color c' message. In the second round of the procedure, if v finds a 
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'proposing color c' (with the same c) in its ports, then it remains in mode ACTIVE. Otherwise (no 
neighbor of v competes with v over color c), it moves to mode COLORED and transmits a 'my color 
is c' message. This completes the description of our protocol. 

The Waiting Hierarchy. The 'waits on' relation induces a hierarchy referred to as the waiting 
hierarchy which is represented by a (collection of) directed tree(s) defined over a subset of the edges 
of the tree T. Our protocol is designed so that if v waits on u, moving to mode WAITING in phase i, 
then in phases 1, . . . , i, u was ACTIVE, and in phase i + 1, n is either ACTIVE or COLORED. Moreover, 
if u is ACTIVE and v £ N{u) is WAITING, then v must be waiting on u. Note also that if v waits on 
u and u moves to mode COLORED in phase j, then v moves back to mode ACTIVE in (the beginning 
of) phase j + 1 and d^~^^{v) = 0. 

Observation. In the beginning of phase i, \C{v)\ > min{(i*(f) + 1,3} for every i G Z>o and node 

V G V\ 

Proof. As long as d^{v) > 3, no neighbor of v can run Procedure RandColor, and hence no neighbor 
of V can move to mode COLORED. Therefore, C{v) = {1,2,3} in the beginning of the first phase 
i € Z>o such that d'^{v) < 2. From that moment on, every ACTIVE neighbor of v that moves to 
mode COLORED decreases both |C(f)| and d^{v) by 1. The assertion is completed by recalling that 
non-ACTIVE neighbors of v must be waiting on v and hence, cannot move to mode COLORED before 

V does. □ 

Corollary 5.1. Consider some node v £ that runs Procedure RandColor. If d^{v) = 0, then 

V moves to mode COLORED with probability 1. Otherwise (d^{v) is either 1 or 2), v moves to mode 
COLORED with a positive constant probability. 

Let be the restriction of to nodes v that were ACTIVE in all phases 1, . . . ,i; this is, does 
not include WAITING nodes that became ACTIVE again (recall that these will move to mode COLORED 
in the next phase with probability 1). Let be the forest induced on T hy V". Given some node 

V G y*, let N^{v) = {n G I {u,v) G E} be the neighborhood of v in and let d^{v) = \N'^(v)\ 
be its degree. Observe that if f G V^, then v G and d^{v) = d^{v). Therefore, \i v £ — V^, 
then d*(f) = 0, in which case v runs Procedure RandColor in phase i and CoroUarv 15.11 guarantees 
that V i 1/^+^ 

The correctness of the protocol can now be established: The logic of Procedure RandColor 
implies that every output configuration is a legal coloring. Since ACTIVE leaves are removed from 

with probability 1 and since every tree has at least two leaves, it follows that V^'^^ = for 
k = \n/2 \ . Combining the properties of the waiting hierarchy with Corollary 15. H we conclude that 
the execution reaches an output configuration within at most k additional phases. It remains to 
analyze the run-time of our protocol. 
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Good nodes. Consider some tree T' . We say that node v of T' is good if ?; is a leaf or if the 

degree of f is 2 and both neighbors of v are of degree at most 2. 

Observation 5.2. In every tree, at least a (1/5) -fraction of the nodes are good. 

Consider some i G Z>o and some node v £ V^. Let T' be the tree to which v belongs in F^. 
We argue that if v is good in T' , then v ^ with a positive constant probability. Indeed, if v 
is a leaf in T', which means that (^{v) = d^{v) = 1, then it either moves to mode WAITING with 
probability 1 (if the neighbor of v has a higher degree) or it runs Procedure RandColor, in which 
case Corollary 15.11 guarantees that v moves to mode COLORED with a positive constant probability; 
if d^{v) = d^{v) = 2 and both neighbors of v in (and in T') are of degree at most 2, then v runs 
Procedure RandColor, in which case Corollarv 15.11 again guarantees that v moves to mode COLORED 
with a positive constant probability. Since Corollary 15.11 also guarantees that nodes of degree in 
F'^ move to mode COLORED with probability 1, we can employ Observation 15.21 and Markov's bound 
to establish the following observation. 

Observation 5.3. There exists two constants < p, c < 1 such that < c\V"^\ with probability 

at least p. 

Similarly to the analysis in Section HI define the random variable Y = min{i G Z>o : \V^\ = 
0}. Observation 15.31 implies that Y is stochastically dominated by a random variable that obeys 
distribution NB(0(logn),l — p) + O(logn), namely, a fixed term of O(logn) plus the negative 
binomial distribution with parameters O(logn) and 1 — p, hence Y = O(logn) in expectation and 
with high probability. Since Y bounds from above the depth of the waiting hierarchy, it follows 
that the execution reaches an output configuration within 2Y phases, which completes the analysis. 
Theorem 5.4. There exists an nFSM protocol that 3-colors any n-node (undirected) tree with 
run-time O(logn). 

6 Computational Power 

A deterministic linear bounded automaton ( dLBA j is a (deterministic) Turing machine whose work- 
ing tape is restricted to the cells specifying the input (this is equivalent to a DSPACE(0(n)) Turing 
machine). A non- deterministic linear bounded automaton, a.k.a., linear bounded automaton (LBA), 
is the non-deterministic version of a dLBA, and a randomized linear bounded automaton (rLBA) 
is the randomized version. Kuroda [24] proved that the class of languages that can be decided 
by an LBA is exactly the context-sensitive languages, corresponding to the Type-1 grammars in 
Chomsky's hierarchy of formal languages |14j . Whether LBAs are equivalent to dLBAs and where 
exactly do rLBAs lie between the two are major open questions in computational complexity (cf. 
the first LBA problem). The following two lemmas show that in terms of its computational power 
(regardless of run-time considerations), an nFSM protocol is essentially equivalent to an rLBA. 
Lemma 6.1. An nFSM protocol on a graph G of arbitrary topology can be simulated by an rLBA. 
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Proof. The input for the Turing machine is the graph G, given as an adjacency Ust. In order to 
simulate the execution of the nFSM protocol, we store some additional information in the entries 
of the adjacency list as follows: For each node v, we store its current state and the next letter 
it transmits. For every node u in the list of neighbors N{v) attached to u, we store the entry 
of u's port that corresponds to v. In each round of the nFSM protocol, the rLBA performs two 
sweeps of the list of nodes: The first sweep serves to calculate u's next state q and transmitted 
letter a for all nodes based on u's current state and the messages in its ports, according to the 
nFSM state machine, which is hard-wired in the rLBA. However, the calculated letter a is not 
being "transmitted" yet, so the calculations for subsequent nodes in the list are not messed up, 
but rather stored in the corresponding place next to v. In the second sweep, for every node the 
letter a is being "transmitted" , that is, the lists of neighbors are traversed, and at each occurrence 
of V, the current letter is replaced by a. This way, we simulate every round of the nFSM protocol. 
In total, our simulation requires additional 0(1) space per node and 0(1) space per edge, hence it 
can be implemented with an rLBA. The assertion follows. □ 

Lemma 6.2. An rLBA can he simulated by an nFSM protocol on a path. 

Proof. Let n be the number of cells in the tape of the rLBA. Then, the path network has n nodes, 
each corresponding to one cell of the tape, i.e., we identify a node v of the path nFSM with a 
certain cell on the tape. Let T be the working alphabet and P be the state space of the rLBA. The 
nFSM protocol is designed so that the state of node v indicates: (1) which letter from F is written 
in v; (2) if the head of the rLBA currently points to v; (3) the current state of the rLBA, which is 
allowed to be incorrect if (2) is false; and (4) if the head is currently located to the left or to the 
right of V. Hence, we fix Q = F x {0, 1} x P x {L, R}. The alphabet of the nFSM is S = {L, R} x P. 

Suppose that the input to the rLBA is 71 . . . 7„ G F". Then, we assume that the initial state of 
the ith node in the path is (71, h,po, L), where po is the initial state of the Turing machine and 



h 



1 if ? = 1 
if z > 1 , 



Note that the distinction between the initial state of the first node in the path and the initial states 
of all other nodes is without loss of generality. Indeed, as the first and last nodes have degree 1 
and all interior nodes have degree 2, it is easy for a node to "decide" (under the nFSM model) if 
it is an interior node. Distinguishing between the first and last nodes is unavoidable if one wants 
to distinguish between the inputs 71 . . . 7n and 7n • • • 7i- 

At all times, we maintain the invariant that exactly one node is in a state in F x {1} x P x {L, R} 
— denote this node as active — whereas all other nodes are in a state in F x {0} x P x {L, R}. Only 
the active node can transmits messages; all other nodes remain silent and listen. If an non-active 
node V receives a message indicating that the head should move to the left (respectively, right), 
and v's state indicates that the head is currently to its right (resp., left), then v becomes the active 
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node; otherwise, v does not react to this message. Now, the nodes simulate the behavior of the 
rLBA by calculating the next state of the rLBA based on the rLBA's transition function (which is 
hard-wired in the FSM) and updating their own states accordingly. The assertion follows. □ 



23 



References 



[1] Y. Afek, N. Alon, Z. Bar-Joseph, A. Cornejo, B. Haeupler, and F. Kuhn. Beeping a maximal 
independent set. In Proceedings of the 25th international conference on Distributed computing 
(DISC), pages 32-50, 2011. 

[2] Y. Afek, N. Alon, O. Barad, E. Hornstein, N. Barkai, and Z. Bar-Joseph. A Biological Solution 
to a Fundamental Distributed Computing Problem. Science, 331(6014):183-185, Jan. 2011. 

[3] N. Alon, L. Babai, and A. Itai. A fast and simple randomized parallel algorithm for the 
maximal independent set problem. J. Algorithms, 7:567-583, December 1986. 

[4] B. Awerbuch. Complexity of network synchronization. J. ^CM, 32(4):804-823, 1985. 

[5] B. Awerbuch, B. Patt-Shamir, D. Peleg, and M. E. Saks. Adapting to asynchronous dynamic 
networks (extended abstract). In STOC, pages 557-570, 1992. 

[6] B. Awerbuch and D. Peleg. Network synchronization with polylogarithmic overhead. In FOCS, 
pages 514-522, 1990. 

[7] L. Barenboim and M. Elkin. Distributed (delta-l-l)-coloring in linear (in delta) time. In STOC, 
pages 111-120, 2009. 

[8] L. Barenboim and M. Elkin. Combinatorial algorithms for distributed graph coloring. In 
DISC, pages 66-81, 2011. 

[9] L. Barenboim and M. Elkin. Deterministic distributed vertex coloring in polylogarithmic time. 
J. ACM, 58(5):23, 2011. 

[10] M. Bellare, O. Goldreich, and M. Sudan. Free bits, peps, and nonapproximability-towards 
tight results. SIAM J. Comput, 27(3):804-915, 1998. 

[11] Y. Benenson, T. Paz-Elizur, R. Adar, E. Keinan, Z. Livneh, and E. Shapiro. Programmable 
and autonomous computing machine made of biomolecules. Nature, 414(6862) :430-434, Nov. 
2001. 

[12] D. Brand and P. Zafiropulo. On communicating finite-state machines. J. ACM, 30:323-342, 
April 1983. 

[13] I. Chlamtac and S. Kutten. On Broadcasting in Radio Networks-Problem Analysis and Pro- 
tocol Design. Communications, IEEE Transactions on [legacy, pre - 1988], 33(12):1240-1246, 
1985. 

[14] N. Chomsky. Three models for the description of language. IRE Transactions on Information 
Theory, 2:113-124, 1956. h ttp : //www . chomsky . inf o/articles/195609 — . pdf , 



[15] R. Cole and U. Vishkin. Deterministic coin tossing with applications to optimal parallel list 
ranking. Inf. Control, 70(l):32-53, July 1986. 

[16] A. Cornejo and F. Kuhn. Deploying wireless networks with beeps. In Proceedings of the 24th 
international conference on Distributed computing (DISC), pages 148-162, 2010. 

[17] R. Flury and R. Wattenhofer. Slotted Programming for Sensor Networks. In International 
Conference on Information Processing in Sensor Networks (IPSN), Stockholm, Sweden, April 
2010. 

[18] M. Gardner. The fantastic combinations of John Conway's new solitaire game 'life'. Scientific 
American, 223(4): 120-123, 1970. 

[19] A. V. Goldberg, S. A. Plotkin, and G. E. Shannon. Parallel symmetry-breaking in sparse 
graphs. SI AM J. Discrete Math., l(4):434-446, 1988. 

[20] P. Gordon. Numerical Cognition Without Words: Evidence from Amazonia. Science, 
306(5695) :496-499, Oct. 2004. 

[21] K. Kothapalli, C. Scheideler, M. Onus, and C. Schindelhauer. Distributed Coloring in 
0{\/\ogn) Bit Rounds. In 20th International Parallel and Distributed Processing Symposium 
(IPDPS), 2006. 

[22] F. Kuhn. Weak graph colorings: distributed algorithms and applications. In Proceedings of 
the twenty-first annual symposium on Parallelism in algorithms and architectures, SPAA '09, 
pages 138-144, New York, NY, USA, 2009. ACM. 

[23] F. Kuhn, T. Moscibroda, and R. Wattenhofer. What cannot be computed locally! In Pro- 
ceedings of the twenty-third annual ACM symposium on Principles of distributed computing 
(PODC), pages 300-309, 2004. 

[24] S.-Y. Kuroda. Classes of languages and linear-bounded automata. Information and Control, 
7(2):207-223, 1964. 

[25] C. Lenzen and R. Wattenhofer. MIS on trees. In Proceedings of the 30th annual ACM SIGACT- 
SICOPS symposium on Principles of distributed computing (PODC), pages 41-48, New York, 
NY, USA, 2011. 

[26] N. Linial. Locality in distributed graph algorithms. SIAM J. Comput, 21:193-201, Feb. 1992. 

[27] M. Luby. A simple parallel algorithm for the maximal independent set problem. SIAM J. 
Comput., 15:1036-1055, November 1986. 

[28] N. A. Lynch. Distributed Algorithms. Morgan Kaufmann, 1st edition, 1996. 



[29] Y. Metivier, J. M. Robson, N. Saheb-Djahromi, and A. Zemmari. An optimal bit complexity 
randomised distributed MIS algorithm. Distributed Computing, 23(5-6):331-340, Jan. 2011. 

[30] J. V. Neumann. Theory of Self-Reproducing Automata. University of Illinois Press, Champaign, 
IL, USA, 1966. 

[31] D. Peleg. Distributed computing: a locality-sensitive approach. Society for Industrial and 
Apphed Mathematics, Philadelphia, PA, USA, 2000. 

[32] S. Plotkin. Graph-theoretic techniques for parallel, distributed, and sequential computation. 
MIT/LCS/TR. Laboratory for Computer Science, Massachusetts Institute of Technology, 1988. 

[33] D. Sadava. Life: The Science of Biology. Sinauer Associates, 2011. 

[34] J. Schneider and R. Wattenhofer. An Optimal Maximal Independent Set Algorithm for 
Bounded-Independence Graphs. In Journal of Distributed Computing, March 2010. 

[35] J. Schneider and R. Wattenhofer. Distributed Coloring Depending on the Chromatic Number 
or the Neighborhood Growth. In 18th International Colloquium on Structural Information and 
Communication Complexity (SIROCCO), Poland, June 2011. 

[36] J. Suomela. Survey of local algorithms. To appear in: ACM Computing Surveys, 2012. 
|http : //www ■ cs .helsinki . fi/u/josuomel/doc/local- survey .pdfl 

[37] M. Szegedy and S. Vishwanathan. Locality based graph coloring. In STOC, pages 201-207, 
1993. 

[38] S. Wolfram. A new kind of science. Wolfram Media, Champaign, Illinois, 2002. 



